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We study the statistics of the number of records R n ,N for N identical and independent symmetric 
discrete-time random walks of n steps in one dimension, all starting at the origin at step 0. At 
each time step, each walker jumps by a random length drawn independently from a symmetric and 
continuous distribution. We consider two cases: (I) when the variance a 2 of the jump distribution 
is finite and (II) when a 2 is divergent as in the case of Levy flights with index < fi < 2. In 
both cases we find that the mean record number (7?n,jv) grows universally as ~ ctN\/n for large n, 
but with a very different behavior of the amplitude ajv for TV > I in the two cases. We find that 
for large N, qjv ~ 2y / log N independently of a 2 in case I. In contrast, in case II, the amplitude 
approaches to an TV-independent constant for large TV, ajv w i/y/w, independently of < fj, < 2. 
For finite a 2 we argue, and this is confirmed by our numerical simulations, that the full distribution 
of (Rn,N I \fn — 2-^log iV)^/log N converges to a Gumbel law as n — > oo and N — > oo. In case II, 
our numerical simulations indicate that the distribution of R n ,N/\/n converges, for n — > oo and 
TV" — > oo, to a universal nontrivial distribution, independently of p. We discuss the applications of 
our results to the study of the record statistics of 366 daily stock prices from the Standard & Poors 
500 index. 
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I. INTRODUCTION 

A record is an entry in a series of events that exceeds all previous entries. In recent years there has been a 
surge of interest in the statistics of record-breaking events, both from the theoretical point of view as well as in 
multiple applications. The occurrence of record-breaking events has been studied for instance in sports 0, in 
evolution models in biology [!, 13], in the theory of spin-glasses d, H[ and in models of growing networks @- Recently 
there has been some progress in understanding the phenomenon of global warming via studying the occurrence of 
record-breaking temperatures . 

More precisely, let us consider a sequence or a discrete-time series of random variables {x(0), x(l), x(2), . . . ,x(n)} 
with n + 1 entries. This sequence may represent for example the daily maximum temperature in a city or the daily 
maximum price of a stock. A record is said to happen at step m if the m-th member of the sequence is bigger than 
all previous members, i.e., if x(m) > x(i) for all i — 0,1,2,..., (m — 1). Let R n denote the number of records in 
this sequence of n + 1 entries. Clearly R n is a random variable whose statistics depends on the joint distribution 
of P(x(0), x(l), . . . , x(n)) of the members of the sequence. When the members of the sequence are independent 
and identically distributed (i.i.d) random variables each drawn from a distribution p(x), i.e., the joint distribution 
factorizes, P(x(0),x(l), . . . ,x(n)) = Y\a=i P( x (i))i the record statistics is well understood from classical theories [T^ — 
In particular, when p(x) is a continuous distribution, it is known that the distribution of record number P(R n , n) 
is universal for all n, i.e., independent of the parent distribution p(x). The average number of records up to step n, 
(R n ) = ~Y^m=i l/ m f° r all n and the universal distribution, for large n, converges to a Gaussian distribution with 
mean rj ln(n) and variance « Inn. 

While the statistical properties of records for i.i.d random variables (RV's) are thus well understood for many years, 
numerous questions remain open for more realistic systems with time-dependent or correlated RV's. In principle there 
are many different ways to generalize the simple i.i.d. RV scenario described above. For instance, one can consider 
time series of RV's that are independent, but not identically distributed. One example for this case is the so called 
Linear Drift Model with RV's from probability distributions with identical shape, but with a mean value that increases 
in time. This model was first proposed in the 1980's [l5| and was recently thoroughly analyzed in Refs. [l6l - [l8j . In 
2007 Krug also considered the case of uncorrelated RV's from distributions with increasing variance 

Another possible generalization is the one where RV's are correlated. Perhaps, the simplest and the most 
natural model of correlated RV's is an n-step one dimensional discrete-time random walk with entries {x(0) = 
0, x(l), x(2), . . . , x(n)} where the position x(m) of the walker at discrete time m evolves via the Markov jump process 

x(m) = x(m — 1) + 77(771) , (1) 

with x(0) = and 77(777,) represents the random jump at step m. The noise variables 77(m)'s are assumed to be 
i.i.d variables, each drawn from a symmetric distribution 7(77). For instance, it may include Levy flights where 
f(rj) ~ |*?| — for large 77 with the Levy index < /j < 2 which has a divergent second moment. Even though this 
model represents a very simple Markov chain, statistical properties of certain observables associated with such a walk 
may be quite nontrivial to compute, depending on which observable one is studying fl9l - |2"lj ]. For instance, in recent 
years there has been a lot of interest in the extremal properties of such random walks. These include the statistics 
of the maximal displacement of the walk up to n steps with several applications [2l| - [25| and the order statistics, i.e., 
the statistics of the ordered maxima [26l |27| as well as the universal distribution of gaps between successive ordered 
maxima of a random walk [27| . 

The statistics of the number of record-breaking events in the discrete-time random walk process in Eq. (JT|) has 
also been studied in a number of recent works with several interesting results |28l432| . In 2008, Majumdar and Ziff 
computed exactly the full distribution P(R n ,n) of the record number up to n steps and found that when the jump 
distribution f(rf) is continuous and symmetric, the record number distribution P(R n ,n) is completely universal for 
all n, i.e., independent of the details of the jump distribution 28]. In particular, for instance, the Levy flight with 
index < /1 < 2 (thus with a divergent second moment of the jump distribution /(?7)) has the same record number 
distribution as for a Gaussian walk (with a finite second moment of f(ri)). This is a rather amazing result and the 
deep reason for this universality is rooted [28| in the so called Sparre Andersen theorem (34} . In particular, for large 
77, P(R n ,n) ~ n~ 1 l 2 G(R n / \/n) where the scaling function G(x) = e~ x2 / 4 / \/n is universal 28]. The mean number of 
records (R n ) ~ An/n for large n (28J. In contrast, this universal result does not hold for symmetric but discontinuous 
/(?7). For example, if f(r]) = ^S(rj — 1) + \ S(t) + 1), then x m represents the position of a random walker at step m on 
a 1-d lattice with lattice spacing 1. In this case, the mean number of records still grows as y/n for large n but with a 
smaller prefactor, (R n ) w y/2n/ir [28j |. 

These results were later generalized to several interesting cases, for instance, to the record statistics of one dimen- 
sional random walk in presence of an external drift [2^. [30| and one dimensional continuous-time random walk with 
a waiting-time distribution between successive jumps [31J. The record statistics of the distance traveled by a random 
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walker in higher dimensions with and without drift has been studied numerically in the context of contamination 
spread in porous medium [32j . In [30( , it was also found that the record statistics of stock markets is very similar to 
the ones of biased random walks. 

While in Refs. [28|-[32j the record statistics of a single discrete-time random walker was studied, the purpose of this 
article is to generalize these results to the case where one has N independent one dimensional discrete-time random 
walks. In this V-walker process, a record happens at an instant when the maximum position of all the walkers at 
that instant exceeds all its previous values. We will see that despite the fact that the walkers are independent, the 
record statistics is rather rich, universal and nontrivial even in this relatively simple model. 

Let us first summarize our main results. We derive asymptotic results for the mean of the record number (R n ,N) 
up to a time n and also discuss its full distribution. It turns out that for N > 1, while the full universality with 
respect to the jump distribution found for N = 1 case is no longer valid, there still remains a vestige of universality 
of a different sort. In our analysis, it is important to distinguish two cases: case (I) where the jump distribution f(rj) 
has a finite variance a 2 — J_ rj 2 f(rj) dn and case (II) where a 2 is divergent as in the case of Levy flights with Levy 
index < /i < 2. In both cases, we find that the mean record number (R n ,N) grows universally as ~ a^y/n for large 
n. However, the N dependence of the prefactor a^, in particular for large N, turns out to be rather different in the 
two cases 

!2\/log N in Case I (independent of a 2 ) 
(2) 
4/- v /7r in Case II (independent of/x) 

In addition, we also study the distribution of the record number R n ,N- For finite a 2 we argue and confirm numerically 
that the distribution of the random variable (R n ,N I y/n— 2 y^og N) %/log N converges to the Gumbel law asymptotically 
for large n and ./V (see section II for details). In contrast, in case II, we find numerically that the distribution of 
Rn,N/Vn converges, for large n and N, to a nontrivial distribution independent of the value of < /i < 2 (see 
section II for details). We were however unable to compute this asymptotic distribution analytically and it remains 
a challenging open problem. Finally, we discuss the applications of our results to the study of the record statistics of 
366 daily stock prices from the Standard & Poors 500 index (33[. We analyze the evolution of the record number in 
subsets of N stocks that were randomly chosen from this index and compare the results to our analytical findings. 
While the strong correlations between the individual stocks seem to play an important effect in the record statistics, 
the dependence of the record number on N still seems to be the same as in the case of the N independent random 
walkers. 

The rest of the paper is organized as follows. In section II, we define the TV-walker model precisely and summarize 
the main results obtained in the paper. In section III, we present the analytical calculation of the mean number of 
records for multiple random walkers, in both cases where a 2 is finite (case I) and a 2 is infinite (case II). Section IV is 
devoted to an analytic study of the distribution of the record number in the case where a 2 is finite. In section V we 
present a thorough numerical study of the record statistics of multiple random walks, and in section VI we discuss the 
application of our results to the record statistics of stock prices. Finally, we conclude in section VII and present the 
technical details of some of the analytical computations concerning the computation of the mean number of records 
and the distribution of the record number for lattice random walks in the three Appendices A, B and C 



II. RECORD STATISTICS FOR MULTIPLE RANDOM WALKS: THE MODEL AND THE MAIN 

RESULTS 

Here we consider the statistics of records of N independent random walkers all starting at the origin 0. The position 
Xi{m) of the i-th walker at discrete time step m evolves via the Markov evolution rule 

Xi(m) = Xi(m - 1) + r)i(m) , (3) 

where 2^(0) = for all i = 1, 2, . . . , N and the noise %(m)'s are i.i.d variables (independent from step to step and 
from walker to walker), each drawn from a symmetric distribution f(r)). We are interested in the record statistics of 
the composite process. More precisely, consider at each step m, the maximum position of all N random walkers 

imax(m) = max[xi(rn),x 2 (m), . . . ,x N (m)] . (4) 

A record is said to happen at step m if this maximum position at step m is bigger than all previous maximum positions, 
i.e. if x ma x(w) > x max {k) for all k = 0, 1, . . . , (m — 1) (see Fig. [T]). In other words, we are interested in the record 
statistics of the stochastic discrete-time series {a; max (m)}, with the convention that the initial position a: max (0) = is 
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Figure 1. Schematic trajectories of -/V = 3 random walkers. Each walker starts at the origin and evolves via the Markov 
jump process in Eq. (|3]). A record happens at step m if the maximum position at step m Xraa^ijn) > a; max (fc) for all 
k = 0, 1, 2, . . . (m — 1). The record values are shown by filled circles. 



counted as a record. Note that even though the position of each walker evolves via the simple independent Markovian 
rule in Eq. ([3]), the evolution of the maximum process {x max (m)} is highly non-Markovian and hence is nontrivial. 

Let R n .N denote the number of records up to step n for this composite iV-walker process. Clearly R n .N is a random 
variable and we are interested in its statistics. For a single walker N — 1, we have already mentioned that the 
probability distribution of the record number R nt i is completely universal, i.e., independent of the jump distribution 
f(rf) as long as f(rf) is symmetric and continuous 28]. In particular, for example, the record number distribution is 
the same for simple Gaussian walkers as well for Levy flights with index < /i < 2. Here we are interested in the 
opposite limit when N — > oo. 

We find that while the complete universality of the record statistics is no longer true for N > 1, a different type of 
universal behavior emerges in the N — > oo limit. In this large N limit, there are two universal asymptotic behaviors 
of the record statistics depending on whether the second moment a 2 = rj 2 f(rj) dr\ of the jump distribution is 
finite or divergent. For example, for Gaussian, exponential, uniform jump distributions a 2 is finite. In contrast, for 
Levy flights where f(rj) ~ \r)\~ for large r\ with the Levy index < /i < 2, the second moment a 2 is divergent. In 
these two cases, we find the following behaviors for the record statistics. 

Case I (a 2 finite): In this case, we consider jump distributions f(r]) that are symmetric with a finite second moment 
a 2 = f_ rj 2 f(rf) drj. In this case, the Fourier transform of the jump distribution f(k) — f(rj) e %kr> dr\ behaves, 
for small k, as 

/» « 1 - y k 2 + . . . (5) 

Examples include the Gaussian jump distribution, f(r]) — \fapK e~ av , exponential jump distribution f(rj) = 
(b/2) exp[— b\r]\], uniform jump distribution over [—/,/] etc. For such jump distributions, we find that for large 
number of walkers N, the mean number of records grows asymptotically for large n as 



(Rn,N) 2 Vh^N ^ ■ (6) 

N— >oo 

Note that this asymptotic behavior is universal in the sense that it does not depend explicitly on a as long as a is 
finite. 
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Moreover, we argue that for large N and large n, the scaled random variable R n .N / \fn converges, in distribution, 
to the Gumbel form, i.e, 



Prob. 



Rn. 



N 



< X 



N-yoa 



x-2VlnN) VhiN 



where F\{z) = cxp [— cxp[— z]] 



(7) 



Indeed, for large N and large n, the scaled variable R n ,N/\/n converges, in distribution, to the maximum of TV 
independent random variables 



Rn. 



N n— > oo 



y/n N- 



> M 



N 



where M N = max(j/i, y 2 , ■ 



,Vn) 



(8) 



where yt > O's are i.i.d non-negative random variables each drawn from distribution p(y) — e y / 4 for y > and 
p(y) = for y < 0. 

Case II (ex 2 divergent ): In this case we consider jump distributions f(rj) such that the second moment a 1 is 
divergent. In this case, the Fourier transform f(k) of the noise distribution behaves, for all k, as 



f(k) = 1-10*1" + ... 



(9) 



where < /i < 2. Examples include Levy flights where f{rf) ~ with the Levy index < /i < 2. For the noise 

distribution in Eq. we find, quite amazingly, that in the large N and large n limit, the record statistics is (i) 
completely universal, i.e., independent of /i and a (ii) more surprisingly and unlike in Case-I, the record statistics also 
becomes independent of N as N — > oo. For example, we prove that for large N, the mean number of records grows 
asymptotically with n as 



{Rn.N) > — 7= Vn , 

JV->oo \/TT 



(10) 



which is exactly twice that of one walker, i.e., {Rn,N-nx>) = 2 {R n ,i) for large n. Similarly, we find that the scaled 
variable Rn,N /-\/n, for large n and large N, converges to a universal distribution 



Prob. 



Rr. 



.N 



< X 



N- 



-> F 2 (x) 



(11) 



which is independent of the Levy index fi as well as of the scale a in Eq. ^ . While we have computed this universal 
distribution F%{x) numerically rather accurately, we were not able to compute its analytical form. 



III. MEAN NUMBER OF RECORDS FOR MULTIPLE WALKERS 



Let R ni N be the number of records up to step n for N random walkers, i.e., for the maximum process x max (n). Let 
us write 

Rm,N — Rm—1,N + £m,N j (12) 

where Cm,N is a binary random variable taking values or 1. The variable £ m .Ar = 1 if a record happens at step m 
and £,m,N = otherwise. Clearly, the total number of records up to step n is 

n 

Rn,N — 2_, €m,N ■ (13) 
m=l 

So, the mean number of records up to step n is 

n n 

{Rn,N) = ^ (€ m , N ) = X! Tm ' N ' ^ 14 ) 
m— 1 m— 1 

where r m jy = {£ m ,N) is just the record rate, i.e., the probability that a record happens at step m. To compute the 
mean number of records, we will first evaluate the record rate r m ,N and then sum over m. 



Figure 2. A record happens at step m with record value x for N = 3 walkers, all starting at the origin. This event corresponds 
to one walker (the dashed line) reaching the level x for the first time at step m while the other walkers stay below the level x 
up to step m. 

To compute r mj j\r at step m, we need to sum the probabilities of all trajectories that lead to a record event at step 
to. Suppose that a record happens at step m with the record value x (see Fig. [5]). This corresponds to the event that 
one of the N walkers (say the dashed trajectory in Fig. [2]), starting at the origin at step 0, has reached the level x for 
the first time at step m, while the rest of the N — 1 walkers, starting at the origin at step 0, have all stayed below the 
level x till the step to. Also, the walker that actually reaches x at step m can be any of the N walkers. Finally this 
event can take place at any level x > and one needs to integrate over the record value x. Using the independence 
of N walkers and taking into account the event detailed above, one can then write 

/>oo 

r m .N = N I p m (x) [qmix)} 1 *^ 1 dx , (15) 
Jo 

where q m (x) denotes the probability that a single walker, starting at the origin, stays below the level x up to step m 
and p m (x) is the probability density that a single walker reaches the level x for the first time at step m, starting at 
the origin at step 0. 

The two quantities p m (x) and q m (x) can be reinterpreted in terms of slightly more familiar objects via the following 
observation. Note that by shifting the origin to the level x and using the time-reversal property of the trajectory of 
a single random walker, it is easy to see that p m (x) is just the probability density that a single walker, starting at 
the origin at step 0, reaches x at step m while staying positive at all intermediate steps. By a similar shift of the 
origin to level x and using the reflection symmetry of the trajectories around the origin, it is clear that q m (x) can 
be interpreted as the probability that a single walker, starting at an initial position x > at step 0, stays positive 
(i.e., does not cross the origin) up to step to. This is then the familiar persistence or the survival probability of a 
single random walker [p. In fact, both these quantities p m (x) and q m {x) can be regarded as special cases of the 
more general restricted Green's function in the following sense. Consider a single random walker starting at position 
x at step and evolving its position via successive uncorrelated jumps as in Eq. ((T|). Let G+(y,x,m) denote the 
probability density that the walker reaches y > at step to, starting at x > at step 0, while staying positive at 
all intermediate steps. The subscript + denotes that it is indeed the restricted Green's function counting only the 
trajectories that reaches y at step m without crossing the origin in between. It is then clear from our discussion above 
that 

p m {x) = G+(x,0,m) (16) 

p OO 

q m (x) = / G+(y,x,m)dy . (17) 
Jo 

In the second line, the survival probability q m (x) is obtained from the restricted Green's function by integrating over 
all possible final positions of the walker. Note also, from Eqs. (|16l) and (fTT|) . that the survival probability starting 
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exactly at the origin is 



q m (0) 



p m {x) dx. 



(18) 



Hence, if we know the restricted Green's function G+(j/,x, m), we can in principle compute the two required 
quantities p m [x) and q m (x). Using the Markov evolution rule in Eq. <jTJ) , it is easy to see that the restricted Green's 
function G+(y, x, m) satisfies an integral equation in the semi-infinite domain [2l| 



G+(y,x,m) 



G + (y',x,m- 1) f(y-y')dy' 



(19) 



starting from the initial condition, G + (y, x, 0) = 5(y — x). Such integral equations over the semi-infinite domain are 
called Wiener-Hopf equations and are notoriously difficult to solve for arbitrary kernel f{z). Fortunately, for the case 
when f(z) represents a continuous and symmetric probability density as in our case, one can obtain a closed form 
solution for the following generating function (rather its Laplace transform) (35j | 



dye 



->>v 



where 



(s, A) = exp 



dx e 



-An x 



oo 

E 

m=0 



G+(y, x, m) s r ' 



G(X,X ,s) 



(s,X)(j>(s,\ ) 

A + An 



ln[l - sf(k)} 
X 2 + k 2 



dk 



and f(k) = f{x)e lkx dx 



(20) 



(21) 



While the formula in Eq. (|20j) is explicit, it is rather cumbersome and one needs further work to extract the 
asymptotic behavior of p m (x) and q m (x) from this general expression. To make progress, one can first make a change 
of variable on the left hand side (lhs) Anx = z and then take the An — > oo limit. Using <f>(s, An — > oo) = I and the 
definition G+(y, 0, m) = p m (y), and replacing y by x we then obtain the following relation 



m=0 



p m (x)e Xx dx = 4>(s, A) 



(22) 



where <fi(s,\) is given in Eq. (|2T1) . Similarly, putting A = on the lhs of Eq. (l20l) . using the definition q m (x) 
J °° G+(y, x, m) dy and replacing An by A, it is easy to see that 



E sm 

m=Q 



q m {x)e Xx dx 



AVI 



:<j>(s,\). 



(23) 



The formula in Eq. (|23j) is known in the literature as the celebrated Pollaczek-Spitzer formula 1 36, 37] and has been 
used in a number of works to derive exact results on the maximum of a random jump process [23l |38M40| . Interestingly, 
this formula has also been useful to compute the asymptotic behavior of the flux of particles to a spherical trap in 
three dimensions [241 EH. E2I] . 

Let us also remark that by making a change of variable Xx = y on the lhs of Eq. (|23p and taking A — > 00, one 
obtains the rather amazing universal result for all m 



00 

E 

m=0 



q m (0) S r ' 



1 



9m (0) 



2m 
m 



I 

2~2™ 



(24) 



which is known as the Sparre Andersen theorem [34|. In particular, for large m, q m (fi) ~ l/^/mn. note that for the 
case of a single walker N = 1, it follows from Eq. (Tf5|) that the record rate at step m is simply given by 



<m,l — 



p m (x)dx = q m (0) = -3- 



2m 
rn 



I 



I 



(25) 



where we have used Eq. (| 18[) and the Sparre Andersen theorem (|24[) . Thus, one obtains the rather surprising universal 
result for the N — 1 case: for all continuous and symmetric jump distributions, the mean number of records up to 
step n, (Ru.n) = Sm=i r m.N is universal for all n and grows as ^An/n for large n (28[. The universality in this case 
can thus be traced back to Sparre Andersen theorem. 
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In contrast, for TV > 1, we need the full functions p m (x) and q m (x) to compute the record rate in Eq. (fTS"|) . This is 
hard to compute explicitly for all m. However, one can make progress in computing the asymptotic behavior of the 
record rate r m ^ for large m and large N, as we show below. In turns out that for large m, the integral in Eq. (fTS"|) 
is dominated by the asymptotic scaling behavior of the two functions p m {x) and q m (x) for large m and large x. To 
extract the scaling behavior of p m {x) and q m (x), our starting point would be the two equations (|22j) and (|23l) . The 
next step is to use these asymptotic expressions in the main formula in Eq. (|15j) to determine the record rate r m ^N 
at step m for large m and large N. The procedure to extract the asymptotics is somewhat subtle and algebraically 
cumbersome. To facilitate an easy reading of the paper, we relegate this algebraic procedure in the appendices. Here 
we just use the main results from these appendices and proceed to derive the results announced in Eqs. © and (fTOl) . 
The asymptotic behavior of p m (x) and q m (x) depend on whether a 2 = J^^r] 2 f(r]) dr] is finite or divergent. This 
gives rise to the two cases mentioned in Section II. 

Case I(cr 2 finite): In this case, we show in Appendix A that in the scaling limit x ~ > oo, m — > oo but keeping the 
ration x/s/rri fixed, p m (x) and q m (x) approach the following scaling behavior 

1 / x \ 2 i 

Pm{x)^-=—gi (- , , where g 1 (z) = -^ze~ z , (26) 

q m {x) ->hi[ X : ) , where h\{z)=&d{z), (27) 



\v2cr" mj 

where erf(z) = -i f* e~ u du. Note that dh\{z)/dz — gi(z)/ ' z. 

Case II {a 2 divergent): For the case when the Fourier transform of the jump distribution f(k) has the small k 
behavior as in Eq. ([9]), we show in Appendix B that in the scaling limit when x —¥ oo, m —¥ oo, but keeping the ratio 
x/m 1 ^ fixed, 

PM ~> ml/ l +1/ll g2 (^j) (28) 
q m {x) -> h 2 (-^) . (29) 

While it is hard to obtain explicitly the full scaling functions g 2 (z) and h,2(z) for all z, one can compute the large z 
asymptotic behavior and obtain 

92(z) ~ 4h> ( 3 °) 

z— ¥oo z + ^ 

h 2 (z) ~ 1-^ (31) 



where the two amplitudes are 



'ft — * i 1 

with the constant /3 M having different expressions for < \i < 1 and 1 < \i < 2 



A, = ^L(3» (32) 
B„ = (33) 



h = *ro=dJ TT^ du for 0< " <1 (34) 

2a M f°° u^ L 

^ = -F75 \ / ttt 2 ,2 du for 1 < M < 2 . (35) 

7rr(2-/i) 7 (l + i^)* 2 



The expressions above (|34[ I35[) can be written in a unified way for any < /i < 2 as 

aT(/i)sin(^) 

2r(l-/i) cos(if" 



= otvi HYT^n = , 2 - (36) 



7T 

where, in the last equality, we have used T(l — u)T(a) = . We recall that here we are considering discrete time 

sin imt 

random walks (JT|). In the continuous time random walk framework, with an exponential waiting time between jumps, 
the quantity g2(z) was studied in Ref. 51|. By performing an asymptotic analysis similar to the one presented in 
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Appendix B, the authors showed that 52(2) behaves, for large z, like in Eq. (|30|) with the same exponent albeit with 
a different amplitude. On the other hand, the exact asymptotic result (|3TJ1) . together with Eq. (1551) can also be used 
to study the normalized pdf p m (x) of the position after m steps, with the condition that the walker stays positive at 
all intermediate steps, which was recently studied in Ref. [iol ]. It reads 

Pm{x) = f/plt)dx ^ h (^) ' UZ) = ^ 92{Z) ' (3?) 

where we have used the Sparre- Andersen theorem p m (x) dx = q m (0) ~ 1/y/nm for large m. From Eq. (|30p . one 
obtains the large z behavior of g-ziz) as 

hM „ 4,^- '--m r » +1 » , (38 ) 

where we have used fJ^(fi) = + 1). On the other hand, if one considers the probability density function P m {x) 
of the position of a free Levy random walk after m steps, with a jump distribution as in Eq. @ after m steps, it 
assumes the scaling form, valid for large m, P m (x) ~ mT 1 ' ^pix J m 1 ' ^ where the asymptotic behavior is given by 

z— ►oo 2 7T 

Therefore the above result (f3"5f establishes that — 2C p : this result was recently obtained analytically in pertur- 
bation theory for \i close to 2, 2 — fi -c 1, and conjectured to hold for any /x, on the basis of thorough numerical 
simulations [401 ] . Here this result is established exactly for any /i g (0,2). While the large z behavior of 32(2) is 
the most relevant one for our study, we mention, for completeness, that its small z behavior was also studied in Ref. 
[EHHHi yielding g^iz) ~ z^/ 2 . Finally we remark that the asymptotic behavior of h,2{z) for large z has been computed 
in great detail recently in Ref. [3!| , only the first two leading terms are presented in Eq. (131]) here. 

We are now ready to use these asymptotic behavior of p m {x) and q m {x) in Eq. (|15l) to deduce the large m behavior 
of the record rate. Noting that for large m, the integral is dominated by the scaling regime, we substitute in Eq. ([15)) 
the scaling forms of p m {x) and q m (x) found in Eqs. (f2l)|) . (|2"T1) . (|2"5|) and (|2"9"|) . We then get, for large m, 



N 



ex.: 



r mjJV «— / ^[/i^)]^- 1 ^, (40) 







where g(z) = gi^(z) and h(z) = hi^(z) depending on the two cases. So, we notice that in all cases the record rate 
decreases as mT 1 / 2 for large m, albeit with different A-dependent prefactors in the two cases. Hence, the mean 
number of records (R n ,N) up to step n grows, for large n, as 



POO 

(Rn,N) ~ oiN V™ > where chn — 2A / g(z) [h(z)] 

Jo 



N-l 



dz . (41) 



Next we estimate the constant «at for large A. We first note that a at in Eq. (l4"TT) can be expressed as 

9( z ) d rn/.MWi 



a " = 2 J o f{t)T z {Hz)] }dz > (42) 

where h'(z) = dh/dz. Noticing that h(z) is an increasing function of z approaching 1 as z — > 00, the dominant 
contribution to the integral for large A comes from the large z regime. Hence, we need to estimate how the ratio 
g(z)/h'(z) behaves for large z. Let us consider the two cases separately. 



Case I (a 2 finite): In this case, we have explicit expressions for gi(z) and hi(z) respectively in Eqs. ([26)) and ([2T 
Hence we get 



r°° d 
2 / dzz-^-[evi(z)] N 
Jo dz 



a N = 2 I dzz— [erf(^)] JV (43) 

j 

dyy^-[exi{y/2)] N . (44) 
dy 







The rhs of Eq. (|44l) has a nice interpretation. Consider A i.i.d positive random variables {j/i, y%, • • • , 2/jv}, each drawn 
from the distribution: p(y) = -j= e~ v / 4 for y > and p(y) — for y < 0. Let Mjv denote their maximum. Then the 
cdf of the maximum is given by 



Prob[A/jv <y] = 



P(y') dy' 







N 

N 



= [er%/2r • (45) 
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The probability density of the maximum is then given by: -^[evi(y / 2] N . Hence, the rhs of Eq. (|44| is just the average 
value (A/jv) of the maximum. This gives us an identity for all N 

a N = (M N ) . (46) 

From the standard extreme value analysis of i.i.d variables [Hj], it is easy to show that to leading order for large N, 
(Mn) ~ 2vha N which then gives, via Eq. (|41|) . the leading asymptotic behavior of the mean record number 

(Ru.n) 2VhdVV^ ■ (47) 

TV— >-oo 



Case II (<r 2 divergent): To evaluate a at in Eq. (J42j , we note that when a 2 is divergent, unlike in Case I, we do 
not have the full explicit form of the scaling functions g-iiz) and hiiz). Hence evaluation of for all N is difficult. 
However, we can make progress for large N. As mentioned before, for large N, the dominant contribution to the 
integral in Eq. (|42p comes from large z. For large z, using the asymptotic expressions in Eqs. (|5P|) and (|31l) . we get 

9i{z) z^oo A,, _2_ 
h' 2 {z) f iB^ 0F' 

where we have used Eqs. (f52"|) and for the expressions of and B^. We next substitute this asymptotic constant 
for the ratio gi{z)jh! 2 {z) in the integral on the rhs of Eq. (|42l) . The integral can then be performed trivially and we 
get, for large N, 



a N > —=. (49) 



4 



la 



From Eq. (|4ip we then get for the mean record number 



(R ntN )J^ -$=yfa. ( 5 °) 



In contrast to case I in Eq. (|4"T)) . here the mean record number becomes independent of N for large N. 



IV. THE DISTRIBUTION OF THE NUMBER OF RECORDS FOR FINITE a 2 



In the previous section, we performed a very precise study of the mean number of records (R n ,N) up to step n, 
in both cases where a 2 is finite and divergent. In the present section, we investigate the full probability distribution 
function (pdf) of the record number R n ,N- However, we have been able to make analytical progress for the record 
number distribution only in case I where a 2 is finite to which we restrict ourselves below. 

The clue that leads to an analytical computation of the record number distribution is actually already contained 
in the exact expression of the mean record number in Eqs. (|41l) and (|44|) . This result suggests that there perhaps is 
a relation between the record number Rn,N and the stochastic variable Y n ^M defined as 

Y n n — max a; max (m) = max max \xi(m)]. (51) 

0<m<n 0<m<nO<i<N 

Note that Y n ^ simply denotes the maximum position of all the walkers up to step n. In this section, we will see that 
for case I where a 2 is finite, indeed there is a close relation between the two random variables R n ,N and Y n ^- 

To uncover this relation, it is actually instructive to consider first the case of N independent lattice random walks 
defined by Eq. © where the noise ^j(m)'s are i.i.d. random variables with a distribution f{rj) — 5^(77 — 1) + hS(r] + l). 
Consider now the time evolution of the two random processes R n ,N and J^.jv- At the next time step (n + 1), if a new 
site on the positive axis is visited by any of the walkers for the first time, the process Y n ^ increases by 1, otherwise 
its value remains unchanged. Whenever this event happens, i.e., a new site on the positive side is visited for the first 
time, one also has a record event, i.e., the process R n ,N also increases by 1. Otherwise R n ,N remains unchanged. 
Thus, the two random processes Y n ^ and R n .N are completely locked with each other at all steps: whenever one of 
them increases by unity at a given step the other does the same simultaneously and when one of them does not change, 
the other also remains unchanged. In other words, for every realization, we have, 3^,+i,jv — Y n> N = R n +\.N — Rn,N- 
Now, initially all walkers start at the origin indicating Yo,jv = while i?o,JV = 1 since the initial point is counted as a 
record by convention. This allows us to write the following identity for all n and N 

Rn n =Y n n + 1 = max max \xAra)] + 1 . (52) 

0<m<nO<i<Af 
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We can now take advantage of this identity to compute the probability P(i? ni yv = M, n) as the distribution of Y n> N, 
i.e., the maximum of N independent lattice walkers up to n steps can be computed using the standard method of 
images. One obtains, after some manipulations left in Appendix [C| for < M < N + 1 



P{R n M = M, n) = 




N 



n 

k-M 



1 



L 2 



2" 



N 



E 

fc=0 



N 



n 

k-M- 



where [x\ is the largest integer not greater than x. For instance, for N = 1 one gets from Eq. ([53 

n 



If n 

P(R 7hN = M, n) = —[ r „ +M _ r 



(53) 



(54) 



where \x~\ is the smallest integer not less than x. We have checked that this formula for N = 1 ([54")) yields back 
the result for the first moment (R n .i) a s obtained in Ref. (28|. From the above formula ([5"3")l one can also compute 
(Ru,n), for instance with Mathematica, although obtaining a simple closed form formula for it for N > 1 seems rather 
difficult. In Table U we have reported the first few values of (Rn,N) for N = 1 to N = 4. 



N = 1 



N = 2 



N = 3 



N = 4 



n = 



n=l 



3 
2 


= 1.5 


7 
4 


7 
4 


= 1.75 


35 
16 


15 

8 


= 1.875 


157 
64 


31 _ 
16 " 


= 1.937... 


671 
256 



n =2 



= 1.75 



= 2.453. 



= 2.621. 



n = 3 



1 = 2.53!. 



731 
256 

6303 
2048 



= 2.855. 



= 3.077.. 



n = 4 



35 _ 






2.187... 


16 ~ 




723 _ 






2.824... 


256 " 




13145 


= 3.209. 




4096 


227343 


= 3.468 


65536 





Table I. First values of (R n ,N) obtained from Eq. (|53[) . 



Using the identity (|52j) . one can also obtain the large n behavior of R n ,N- Indeed, in this limit, each rescaled ordinary 
random walk Xi(rn)/y/n converges, when n — > oo, to a Brownian motion B D= i i (r) with a diffusion coefficient 
D = 1/2, on the unit time interval, r £ [0, 1]. Therefore from the above identity (|52l) one gets, in the limit n —> oo 



N 



M. 



N 



max max 

Ki<N 0<t<1 



D 



D= 



i(r) 



(55) 



Now, the distribution of max < T <i B D= i i (r), i.e., the maximum of a single Brownian motion (with diffusion constant 
D = 1/2) over a unit interval is well known (see e.g., in (2l| ) 



Proba. [Mi < m] 



exp 



x 



dx = erf 



m 

71 



(56) 



Eq. ([53]) demonstrates that in this case, R u .n I ' \fn is distributed like the maximum of a collection N i.i.d positive 

random variables {zi,Z2, ■ . ■ ,Zn}, each drawn from the distribution: p(z) = yj~^e~ z I 2 for z > and p(z) — for 

z < 0. From Eq. (|S"6"|) one obtains also that (i?i, n ) 
method. More generally for any N one has 



y/2n/ir, for n ^ 1, as obtained in Ref. [28], using a different 



lim Proba. 



Rn. 



N 



< m 



/ - \ N 

= Proba. [Mi < to] = 



(57) 



Having discussed the lattice random walk, let us now return to the case where the jump distribution is continuous 
in space but with a finite a 2 . In this case, we do not have an identity between R n ^N and Y n ^ analogous to Eq. (|52[) 
for lattice random walks. Nevertheless, we conjecture below and later provide numerical evidence in section VA, that 
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for large n, the scaled random variable R n ,N / \fn converges, in distribution, to the scaled variable Y n ^jyfn up to a 
prefactor a/y/2, i.e., 



lim 



R,, 



a/2 Y n N 
nm =- 



n— >oo -v/ 71 n— foo (J 



(58) 



where = indicates that the random variables on both sides of Eq. (|58|) have the same probability distribution. On the 
other hand, using central limit theorem, it is easy to see that the position of each rescaled walker ^ Xt ^j^ > converges 
in distribution, as n — > oo, to a continuous-time Brownian motion Bd = i^(t) with a diffusion coefficient D = 1, over 
the unit interval r <E [0, 1]. Thus the conjecture in Eq. (|58|) is equivalent to 



lim 



M 



N 



max max [£?r> = 

Ki<N 0<t<1 



i(r)] • 



(59) 



The argument leading to this conjecture in Eq. (J59J) can be framed as follows. Consider first the case for N = 1. In 
this case, the full distribution of R n ,i was computed in Ref. [28| for all n and in particular, for large n, it was shown 
that [H that 



lim 

n— >oo 



R), 



Mi 



max [£>n = 

0<T<1 



l(t)] 



(60) 



where Bd=i(t) is a Brownian motion with diffusion coefficient Z? = 1 on the unit time interval starting from Bd=i{0). 
This result (1601) . for continuous jump distribution, is very similar to the one obtained for a lattice random walk in 
Eq. (|55|) for N = 1, except that the diffusion coefficient of the Brownian motion involved in the discrete case is 
D = 1/2 while it is D = 1, independently of a 2 , for continuous jump distributions. In particular, from Eq. (|60p one 
obtains 



Proba. ( j} < x 



-> erf 



(1) 



(61) 



Hence, at least for N = 1, we know that for large n, Rn,i/s/n for the continuous jump distribution in Eq. (|60[) 
behaves in a statistically similar way as that for lattice walks, the only difference is that the effective diffusion constant 
of the underlying Brownian motion is D — 1 in the continuous case (Eq. (|5"9")l ). while it is D = 1/2 for the lattice case 
(Eq. (|55pV Based on this exact relation for N — 1, it is then natural to presume that this asymptotic equality in 
law between record numbers for continuous jump process and lattice walks holds even for N > 1, thus leading to the 
conjecture in Eq. (|59|) . As a first check of the validity of this conjecture, we note that the result for the first moment 
of the record number in Eqs. pTj) and (jlB")) is fully consistent with the conjecture in Eq. (|59p. 

As announced in the introduction ([8]). the conjecture in Eq. (|59j) is equivalent to say that R n ,N / ' \fn converges, in 
distribution, to the maximum of N independent random variables Mjy = max(yi, j/2, • ■ • > Un), where yt > 0's are i.i.d 
non-negative random variables each drawn from distribution p(y) = -7= e~ y / 4 for y > and p(y) = for y < 0. In 
particular, the cdf of R n ,N / ' sfn is given by 



. R n jy 
Prob. — < x 



rf 



A'* 



(62) 



In section VB we will demonstrate that this conjecture is well supported by our numerical simulations. From Eq. 
(I5T)1) , one can then use standard results for the extreme statistics of independent random variables 53] to obtain that 
for large and large n, the scaled random variable R n ,N /\/n (properly shifted and scaled) converges, in distribution, 
to the Gumbel distribution as announced in Eq. ([?]>. 

Although we have not found a rigorous proof of the above result (|59p . the fact that both formulae for random 
walk with discrete (155]) and continuous (I5T)1) jump distribution differ essentially by a factor of \/2 is reminiscent of a 
similar difference, by the same factor of for the survival probability q n {0) corresponding to both random walks 
(starting from the origin). This quantity g n (0) plays indeed a crucial role in the computation of the record statistics 
of a random walk 28]. For the lattice random walk, one has indeed q n (0) ~ \f2/nn~ 1 l 2 while for the continuous 
random walk one has, from Sparre- Andersen's theorem, q n (0) ~ yjl/nn^ 1 / 2 , independently of a 2 . This fact certainly 
deserves further investigations. 

In the case of divergent a 2 , the two random variables R n ^M and Y n ^ do not seem to be related in any simple way. 
This is already evident from the result for the mean record number (i? n ,jv) in Eq. ([5U)) for < /1 < 2. As n —> 00, 
(Ru,n) ~ 4/-y/7r y/n where the prefactor does not depend on N for large N. In contrast, using standard extreme value 
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statistics 53|, it is easy to show that the mean value (Y n ^) ~ (N n) 1 ^ for large n and N with < fi < 2. This 
rather different asymptotic behavior of the mean thus already rules out any relationship between R„ t N and Y n n for 
case II. So, for this case, our result for the distribution of R n ,N is only restricted to numerical simulations that are 
presented in the next section. 



V. NUMERICAL SIMULATIONS 



In this section we present the results of our numerical simulations of N independent random walks both with a finite 
a 2 (case I) and with a divergent a 2 (case II) and compare them with our analytical results. In the first subsection 
IV Al we study the statistics of the record numbers (both its mean value and its full distribution). Since, at least in 
case I, the mean record number is strongly related to the expected maximum of the process we will then analyze the 
evolution of the largest of the N random walkers. This will be done in section IV Bl We find that the statistics of 
the maximum significantly differs between the cases I and II. Finally, in section IV CI we will consider the correlations 
between individual record events in the two different cases and show that, at least asymptotically, these correlations 
are not different from the case of only one single random walker. 



A. Statistics of the record numbers 



Case I (cr 2 finite). In Fig. ©, we show our numerical results for (i? n ,iv) for a 2 finite, which were obtained by 
a direct simulation of the jump process in Eq. ([1]) with n — 10 4 steps, with a Gaussian distribution of the jump 
variables r)i(m)'s (mean and a 2 = 1). These results have been obtained by averaging over 10 3 different realizations 
of the random walks. These data, on Fig. © are indexed by the label 'Gaussian'. Our numerical data show a very 
nice agreement with our analytical result obtained in Eq. (|4ip yielding {R u ,n) / \fn = &n- The large N behavior of 
ajv can be easily obtained by a saddle point analysis, yielding: 



(Rn,N) 



2vTogiV 



log(logTV) 



■0[(logJV) 



-l/2i 



It turns out that for N 



2VlbgiV 

1000, the sub-leading corrections (|63l) are still sizeable. 



(63) 



o 
o 
o 



cr 

V 
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Figure 3. Rescaled mean record number (R n ,N) / \fn for a fixed n — 1000 plotted against the number of random walkers N. 
We performed simulations with jump distributions of the type / (rj) ~ an d different fj, = 1, 1.5, 1.8 and 1.95 and for the 

Gaussian jump distribution with zero mean and a 2 = 1. The Gaussian case is compared to our analytical finding for the finite 
a 2 case (Eq. I)47p h which is given by the dashed line. The thick black line gives the analytical result for the infinite a 2 regime 
(Eq. (|50|l h With increasing iV all (Ru.n) j\fn with /i < 2 approach this line of value A/yfii. 



We have also computed numerically the distribution of the (scaled) record numbers R n ,N / \fn and compared it to 
our conjecture in Eq. (JSHJ). The results of this comparison, for different values of N — 2, 4 are shown in Fig. 2] ,where 
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the data were obtained by averaging over 5 x 10 4 realizations of independent random walks of n = 10 4 steps. The 
data, for two different continuous jump distributions (exponential and uniform) show a very nice agreement with our 
analytical prediction in Eq. (|62[) . which we argue to be an exact result. As mentioned above, one expects that this 
distribution will converge, for N — > oo to a Gumbel distribution ([7]), albeit with strong finite N effects. 




Figure 4. Scaled probability distribution function \fnP{R n ^M , n) as a function of R n ,N /\fn for N = 2 and N = 4 independent 
of random walks of length n = 10 4 . For each value of N, we show the result for the case where the jumps are distributed 
exponentially ('exp') and uniformly between —1/2 and 1/2 ('uni'). The dotted lines correspond to the result in Eq. ((62} which 
we conjectured to be the exact one. There are no fitting parameters. 

Case II {a 2 divergent). In Fig. ([3]), we show our numerical results for {R u ,n) for a 2 divergent, obtained by 
a direct simulation of the random walk (JXJ) where the distribution of rji(m)'s has a power law tail f{rj) ~ 
with fj, < 2, and different values of fx. The data presented there were also obtained by averaging over 10 3 different 
realizations of random walks, with 10 4 steps. These data show that, in this case, (R n n) / \fn approaches a constant 
value for fixed (and large) n and N — > oo, which is fully consistent with the value of 4/y / 7r obtained analytically in 
Eq. (|50p. The simulations also show how the speed of this convergence is modified when fi < 2 is varied. While for 
small (i <C 2, (R n ,N) I V™ approaches the universal limit value of A/^/tt very quickly, we find a slower convergence for 
2-/K1. 

The numerical computation of the distribution of the (scaled) record number Rn,N/Vn is of special interest because 
an analytical study of it, beyond the first moment, is still lacking. The two plots on Fig. ((5|) show our numerical 
results for this distribution, which were obtained by averaging over 10 4 independent random walks of length n = 10 4 . 
The left panel in Fig. ([5]) shows the rescaled distribution of R n ,N / \fn at a fixed time step n = 10 3 and ii = 1. 
Apparently all curves for N = 10, 10 2 , 10 3 and 10 4 collapse on one line. In the inset of the left figure we kept n = 10 3 
and N — 10 3 fixed and varied ii: one can see that all the cdf's collapse. These results suggest that 



Prob. 



Rn.N , 
7=r- < X 



N- 



+ F 2 {x) , (64) 



where the limiting distribution function F 2 (x) is independent of /i < 2. We tried to guess the analytic form of F% (x) 
by comparing with several known continuous distributions that are defined for positive real numbers. We are certain 
that f 2 (x) is not a Gaussian distribution. By far the best results were obtained by fitting with a Weibull distribution 

F 2 (x) = l- exp (- (Xx) k ^j , (65) 

with two free real parameters A > 0, k > 0. Fitting with the least-squares method gives values of A w 0.8944 ± 0.0003 
and k = 2.558 ± 0.003. The right panel in Fig. ((SJ compares this fit with the distribution obtained from a simulation 
of N = 10 4 random walks of length n = 10 4 . While the agreement, both for the cdf and the probability density 
function (pdf) is quite good, there are still some deviations between the two, particularly for small values close to 
zero. We are not sure, if this difference is a finite N effect or if the real limit distribution of R n ,N / 'y/n for N — >• oo 
effectively differs from a Weibull distribution. 
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Figure 5. Left: Cumulative distribution function (cdf) of the scaled variable Rn,N/\/n for the Levy index fj, = 1, which 
approaches the universal distribution F% (x). The figure gives results for a fixed n = 10 3 and different TV, for each TV we 
performed 10 5 simulations. The inset gives simulations of the cdf for fixed TV = 10 3 while Levy index is varied. Right: The 
cdf for /1 = 1 and TV = 10 4 . We have fitted the data with a Weibull distribution as in Eq. (|65[). where the fitting parameters 
were A w 0.8944 ± 0.0003 and k — 2.558 ± 0.003. The inset gives the corresponding curves for the pdf 's. 



B. Temporal evolution of two stochastic processes: the record number R u .n and the global maximum V nj jv 

up to step n 

In the case of jump distributions with a finite second moment a 2 (case I), we have shown that the mean (Y^jv) of 
the maximum of all walkers up to step n and the mean record number (R ni pj} are proportional to each other in the 
limit n — > 00, both grow with \Jn In TV. In contrast, for Levy walks with index \i < 2 (case II) the relation between 
these two observables does not hold any more and the mean record number grows much slower than the maximum 
(see the discussion at the end of section IV). To illustrate the similarities and differences in the growth rate of R n ,N 
and Y n ^N in the two cases (I and II), we compare their respective time evolution for 4 different samples in Fig. (j^J. 
On the left panel, we consider the Gaussian jump distribution with zero mean and a 2 — I and we see that the process 
R u ,n and (y/2/a) Y n ^ become identical very quickly. Moreover, the two processes evolve almost in a deterministic 
fashion with growing n and hardly fluctuate from one sample to another. In contrast, on the right panel where we plot 
the two processes for fi = 1, their behavior change drastically. First of all, the two processes R n> N and Y n ^ do not 
seem to have relation to each other. While R n _N again evolves almost deterministically and in a self-averaging way, 
the trajectories of the process Y n> N differ strongly from one sample to another and Y n> N is clearly non self-averaging. 
In particular, the process Y n ^ can, like in a single Levy flight, perform very large jumps exceeding its previous value 
by several orders of magnitude. 



C. Correlations between record events 

An important feature of the record statistics of a single random walk (TV = 1) is its renewal property, which leads 
to the fact, that each time after a record event, the record statistics is, in some sense, reseted. After a record event 
the process evolves as a new process with the record value as its new origin. Therefore it is very simple to give the 
pairwisc correlations between record events. In fact, from the above argument, we have 

Prob [rec. at n — k and n) — Prob [rec. at n — k] x Prob [rec. at k] — r n -k,\rk,i ■ (66) 

Using the results from [28} this gives the following (exact) result for Prob [rec. at n — k and n]; 

Prob [rec. at n - k and n] = ( 2 ) ( ^ ) 2 " 2 " • ( 67 ) 

In the special case of k = 1 we find Prob [rec. at n — 1 and n] = |r n _i. With this we find that the conditional 
probability of a second record directly following a record that just occurred is always given by 

Prob [rec. at n|rec. at n — 1] = — . (68) 
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Figure 6. Left: Time evolution of the record number R n ,N and the rescaled maximum value (\/2/a) F„,jv reached up to the 
n-th step for four different realizations of N — 1000 random walks (labeled A,B,C,D) with Gaussian jump distribution (zero 
mean and a = 1) (case I). Here, the results look rather deterministic and for n —¥ oo, we find (V2/a) Y„,jv ~ R n ,N for every 
realization. Right: R n ,N and y„,jv for four different realizations of N = 1000 Levy flights (labeled A,B,C,D) with fj, = 1. The 
behavior of Y„ : n for the Levy flight is completely different from R„,n'- while R„,n is self-averaging, F n ,jv fluctuates widely 
from one sample to another and is not self-averaging. 




Figure 7. Probability for two successive records at times n — 1 and n for a single random walker as well as iV random walks 
with jump distributions of tail- exponents fi — 2 (case 1) and n = 1 (case 2). In all three cases this probability asymptotically 
approaches a value of 1/2 times the probability for a record in the nth step. Therefore, for large n, the correlations between 
the record events in the N 3> 1 regime are the same as in the N = 1 case. This is also shown by the inset, where we plotted the 
probability for a record in the nth step conditioned on a record in the previous one, which approaches 1/2 in all three cases. 



In our efforts to understand and compute the distribution of the record number R n .N for Levy flights (case II), 
we considered the correlations between successive record events also for N 3> 1 random walks. If the correlations 
between successive record events in the large TV limit would vanish, one could assume that the asymptotic distribution 
of R n ,N approaches a Gaussian. However, we found that this is not the case. Fig. ([7]) gives the behavior of 
Prob [rec. atn-fc and n] for the N = 1 case, as well as for N = 10 3 for random walks of the two cases I and II with 
Levy indices \i — 2 and fi = 1. In all three cases Prob [rec. at n — 1 and n] approaches ^Prob [rec. at n — 1] = \r n ^ 
for large n, proving that for n — > oo the probability for a second record after an occurred one is just 1/2. The inset of 
Fig. ([7]) also shows this behavior. Here, while for TV = 1 the conditional probability for a second record is always 1/2, 
this value is only approached for larger n in the case of N 3> 1. For small n the conditional probability is larger. 
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VI. COMPARISON TO STOCK PRICES 

The oldest application of the random walk model, which was already proposed by Le Bachelier [45] in 1900, is the 
one to stock data 0, |47} . In his model the stock prices perform a so-called geometric random walk and trends in 
the stocks are modeled by a linear drift in the logarithms of the stock prices. In [3(| the record statistics of stocks 
in the Standard and Poors 500 |33j (S&P 500) index were compared to the records in a random walk with a drift. 
The authors could show that on average, on a time interval of n = 100 trading days, the statistics of upper records 
in individually detrended stocks are in good agreement with the same statistics of a random walk with a symmetric 
jump distribution. The lower records however showed a significant deviation from this model. 

Here, we want to extend this analysis to the record statistics of A stocks. The question is, to what degree, the 
record statistics of A normalized and randomly chosen stocks from the S&P 500 can be compared to the record 
statistics of A independent random walks. As in [3(|, the observational data we used consisted of 366 stocks that 
remained in the S&P 500 index for the entire time-span from January 1990 to March 2009. Overall, we had data for 
5000 consecutive trading days for each stock at our disposal. In [3(| we found that it is useful to analyze this data 
over smaller intervals, on which one can then detrend the measurements. We decided to split up the 5000 trading days 
into 20 consecutive intervals of each 250 trading days, which is roughly one calendar year. In each of these intervals 
we considered the logarithms Xi = In Si/ So of the stock prices Si, where So is the first trading day. The random 
walk model then suggest that these logarithms Xi perform a biased random walk that starts at the origin (Xq = 0) . 
Since our analytical theory presented in this paper only works for symmetric random walks we had to detrend the 
stocks. We subtracted an index-averaged linear trend obtained by linear regression from the AYs in order to obtain 
symmetric random walkers. Finally, in order to make the stocks comparable to our model of A random walkers of the 
same jump distribution, we had to normalize the JQ's by dividing through the standard deviations of the respective 
individual jump distributions. After this detrending and normalization we can assume that the jump distributions in 
the individual time series have at least the same mean and the same variance, which should then be given by and 1. 

For a fixed A, we randomly selected subsets of size A from the 366 detrended and normalized stocks for each of 
the 20 intervals of length n = 250 and computed the evolution of the record number R n ^M in these subsets. To get 
reliable statistics we average R n ,N over 10 4 different subsets with a fixed A and also averaged over the 20 consecutive 
intervals. The resulting (R n ,N)'s for the upper and the lower record number and A between 1 and 100 are given 
in Fig. ||HJ|. We find that both the curve for the upper and the curve for the lower mean record number are not in 
agreement with our theoretical prediction for the case of A Gaussian random walks given by (Rn,N) — ZVn In A. 
However, Fig. © shows, that the (R n ,N) for the stocks increase with N. We also considered subsets of size A > 100 
and found that for A closer to the maximal value of 366, (Ru.n) gets almost constant in A. While the increase of 
(Rti,n) for smaller N indicates that the statistics behave like A independent Gaussian random walks, the fact that it 
saturates for large A could indicate that they behave like a Levy flight with Levy index fi < 2. We know however, that 
the tail exponent of the daily returns In Si/ Si— % in the stock data is much larger than /i = 2 and that they definitely 
do not perform a Levy flight [48h501 |. Much more likely is that the correlations between the individual stocks play 
an important role. In addition, we observed that at least for N < 100, (R n ,N) grows proportional to 2y/n1nN. One 
way to interpret this finding is the following: When we assume that in A stocks only a smaller number of A 7 (with 
7 e R + and 7 < 1) is effectively independent and that only these A 7 stocks contribute to the record statistics, the 
mean record number should be given by 

(Rn,N) = (R n , A 7 )( Ga — ) = 2^7™ In N , (69) 

and saturate if the value of -/V 7 ^ is achieved, where N max is the total number of stocks. In Fig. ((SJ we fitted curves 
of the form ^/j± 2-s/nln A with ^/7+ ~ 0.655 and m 0.605 to the development of the upper and lower (i? ni jv)- 
The good agreement with the fitted curves and the data confirms our assumption. Apparently, the record statistics of 
A detrended and normalized stocks is the same as the one of A 7 independent Gaussian random walks. This finding 
is also confirmed by the inset in Fig. ((8]). There we plotted (i? n ,Af)/v / ln A for different interval length n and some 
different subset sizes A. The fact that all the lines collapse tells us that (i?„ i Ar)/vmA is independent of A and 
therefore 



{Ru,n) oc Vn In A. 



(70) 
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Figure 8. The averaged upper and lower record number after n = 250 trading days in the S&P 500 stock data. The 5000 
trading days in [33j were subdivided in 20 intervals of 250 days and then linearly detrended in these intervals using the average 
linear trend of the index. Then we chose N stocks randomly out of the total number of 366 stocks and analyzed the evolution 
of the record number in this set. This random picking was repeated 10 4 times and the results were averaged to obtain the 
figure. The dashed lines give our analytical prediction for TV Gaussian random walks multiplied by fitted prefactors. The inset 
gives the behavior of the (R n ,N) / Vhi TV for different N plotted against the interval length n, confirming the proportionality 
(R n , N ) oc VlnN. 



VII. CONCLUSION 



In conclusion we have presented a thorough analysis of the record statistics of N independent random walkers with 
continuous and symmetric jump distributions. For N > 1, we have found two distinct cases: the case where the 
variance of the jump distribution a 1 is finite and the case where a 2 does not exist (case II) as in the case of Levy 
random walkers with index < fi < 2. In the first case we have found that the mean record number behaves like 
(Ru.n) ~ 2y/log Ny/n for n, N > 1 while in case II, (Rn,N) ~ v^AV" for n, 2V » 1. 

We have then argued that, in the first case, the full distribution of the scaled number of records R n ,N /y/n is given by 
the distribution of the maximum of N independent Brownian motions with diffusion coefficient D = 1. This statement 
was suggested by an exact result for lattice random walks and it was corroborated (i) by our exact calculation of 
the first moment (R n ,N) / y/n valid for any value of N and (ii) by our numerical simulations. Of course it would be 
very interesting to obtain a proof of this result. From this connection with extreme value statistics, one thus expects 
that the distribution of R ni N / ' yfn converges, for /V — > oo, to a Gumbel form ([7]). This connection between record 
statistics and extreme value statistics could also be useful to compute the record statistics in other models discussed 
in the introduction, for instance, in the Linear Drift Model [T3, H3|- In the case of Levy random walkers, we have 
shown numerically that the full distribution of (R n ,N) / y/n converges, when N — > oo, to a limiting distribution ^2(2;) 
which is independent of /1. The exact computation of this universal distribution remains a challenging problem. Other 
interesting questions concern the extension of these results to include a linear drift [29| or to the case of constrained 
Levy walks, like Levy bridges which were recently studied in the context of real space condensation phenomena [43| . 
Finally the applications of our results to the record statistics of stock prices from the Standard & Poors 500 index 
suggest that, among a set of N stocks, only a smaller number, which scales like N 7 , with < 7 < 1, are effectively 
independent. The record statistics of these N 7 stocks is then very similar to the statistics of TV 7 independent random 
walkers. This idea might be useful for future investigations of the fluctuations of such ensemble of stock prices. 
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Appendix A: Scaling behavior of p m (x) and q m (x) for finite a 2 

We start from Eqs. (f22j) and ([23]) . When a 2 is finite, by central limit theorem, the typical position of a walker after 
m steps scales as m 1 / 2 for large m. Hence the natural scaling variable is z = xjm 1 ! 2 . Consider first Eq. (|2"2"j) satisfied 
by p m (x). To extract the leading scaling function in the scaling limit x — > oo, m — > oo but keeping z — x/m 1 / 2 fixed, 
we need to investigate <fi(s, A), given explicitly in Eq. (|21[) . in the limit when A — > 0, s —¥ 1 but keeping the ratio 
X/yl — s fixed. To extract the behavior of </>(s, A) in this scaling limit, it is advantageous to work with an alternative 
expression of <j>{s, A) derived in Ref. 24] for finite a 2 



( S ,A) = 



yT^s + cr A v^s/2] 



exp 



A r°° 

7T io 



A 2 + fc 5 



In 



1 - 

1 -s + ser 2 £; 2 /2 



(Al) 



This expression is more suitable for extracting the scaling limit. In the limit A — > and s — > 1, the expression inside 
the exponential in Eq. (|A1|) tends to and hence, to leading order, we have 



(s,\) 



(A2) 



[VT~s + aXy/s/2] 

Inverting the Laplace transform with respect to A (it has a simple pole at A = —y/2(l — s)/a) one gets from Eq. ([22 

V2 _ 



(A3) 



Setting s — 1 — p with p — > in the scaling limit, the sum on the lhs of Eq. (|A3|) can be approximated, to leading 
order, by a continuous integral: X)m=o sm Pm( x ) ~ J Pm{x) e~ pm dm and we have 



Jo cr 



(A4) 



Next we need to invert the Laplace transform with respect to p. We use the explicit inversion formula, 
exp[— 6 2 /4m]. Applying this to Eq. (|A4[) gives, to leading order, in the scaling limit 



- b Vp] 



2 v^rr 



Pm(x) 



0\/7T 771 



3/2 



exp 



2a 2 m 



which can be reorganized in the scaling form 

1 



Pm{x) 



'2a 2 777 



9l 



V20 2 



, where gi(z) = —=ze 



(A5) 



(A6) 



Next we consider q m {x) given in Eq. (|23|) . Following exactly the same procedure as in the case of p m (x) we find, 
in the scaling limit, 



/•oo 

/ q m (x)e-P m dm 
Jo P 



l r 



-^/2px/a 



(A7) 



Inverting the Laplace transform with respect to p upon using the explicit inversion formula, £ p \ m [e h ^ /p] 
erfc(6/v / 4777,), we get, to leading order in the scaling limit 



q m (x) sa 1 - erfc 



V20 2 



erf 



\j2a 2 777 



(A8) 



which proves the result in Eq. ((27] 



Appendix B: Scaling behavior of p m (x) and q m (x) for divergent a 2 

We consider jump distribution f(j]) such that its Fourier transform behaves, for small k, as / (k) ~ 1 — \ak\^ 
with < fi < 2. In this case, the position of the walker after m steps, grows as m 1 ^ for large m [23|. Hence the 
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natural scaling limit is x — > oo, to — > oo with the ratio x/m 1 ^ fixed. For p m (x), we expect a scaling form p m (x) w 
m -i/2-i//* 52 (a;/ TO i/A«). The power of to outside the scaling function is chosen to ensure that J °° p m (x)dx ~ to 1 ' 2 . 

This is needed since we know from Eq. (|18l) and the Sparre Andersen theorem in Eq. that p m (x)dx — 

9m(0) ~ 1/y/nm for large to. Similarly, for q m (x), we expect a scaling form q m {x) ~ ^(x/m 1 ^) in the scaling limit. 

To extract the leading scaling functions (72 (-z) and h 2 (z) respectively from Eqs. (|22l) and ([2"3"]l. we need to investigate 
the function (f>(s,X) in Eq. (l2~Tj) in the corresponding scaling limit A — >■ 0, s — > but keeping the ratio A/(l — s) 1 /' 1 
fixed. Fortunately, this was already done in Ref. [23| in a different context. Setting s — 1 — p with p — > 0, the leading 
behavior of 0(s, A) in the scaling limit is given by (see Eqs. (43)-(47) of Ref. (23[) 



(s, A) w — exp 



1 



In 



l + -(aA«)" 



(Bl) 



Let us first consider the function p m (x) in Eq. (f2"2"j). We substitute the anticipated scaling form p m {x) = 
m -i/2~i/ fi J m i/mu^ Qn £ ne j ng Q £ (|2"2"1) . As before, setting p = 1 — s, we can replace, in the scaling limit, the 
sum over to by a continuous integral over to 



00 

m=0 



p m (x)e Xx dx 



OO P OO 



dx dme- Xx - pm m- 1 ^- 1 ^ 1 gi{wrrC x > '"). 



(B2) 



We then make a change of variable x m 



-l/p 



z and pm = y to get 



00 



m— 



> 1 

p m (cc) e~ Xx dx fa — 



OO /'OO 



t tedy fla (2)l/- 1/2 e- (Ap ~ 1/ '* ) * l/ '''- 1 ' 



(B3) 



We next substitute Eq. (|B3|) on the lhs of Eq. ([22]) and Eq. (|BT|) on the rhs of Eq. Ip2|). Writing the scaled variable 
as Xp^ 1 ^ — w and comparing lhs with the rhs, we see that the 1/Vp cancels from both sides leaving us with 



dz g 2 (z) 



dyy 



exp 



da 



1 



In [1 + a^w^u^} 
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Similarly, by substituting the anticipated scaling form q m {x) = h^x/m 1 /^) on the lhs of Eq. ([23]) and doing exactly 
the same series of manipulations, we get 



dzh 2 (z) / dyy 



i 1 /) 1 e -v e r w v 



l/M, 1 , , 

= -J,H 



(B5) 



where J^(w) is defined in Eq. (]B4[) . 

For later purposes, it is further convenient to define a pair of Laplace transforms 



<J2 



(P) = f 
JO 



g 2 (z)e- pz dz 



h 2 (z)e- pz dz 



(B6) 
(B7) 



in terms of which Eqs. (]B4[) and (]B5[) read 



dyy- 1 / 2 e-y~g 2 (wy L/ n = M™) 
dyy^^e-yhiwy 1 ^) = - J M H 



(B8) 
(B9) 



The equations (]B4[) and (|B5[) determine, in principle, the two scalings functions #2(2) and h 2 (z) for all z. In 
practice, it is hard to invert these two equations to obtain g 2 (z) and h 2 (z) for all z. However, it is possible to extract 
the large z asymptotics of these two functions by analyzing the leading singular behavior of J M (w) in Eq. (]B4[) as 
w — > 0. Clearly, it follows from the definition of Ju(w) in Eq. (IB4[) that J M (0) = 1. We are, however, interested in the 
leading singular correction term in Ju{w) as w — > which, as it turns out, depends on whether < fx < 1, 1 < // < 2 
or /i = 1. Below, we consider the three cases separately. 
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1. The case < fi < 1 

We consider J^iw) in Eq. (|B4[) and compute the derivative JUw) as w — > 0. Simple computation shows that 

■/» ^ -A* V^" 1 ; where 6 M = ^ [°° (BIO) 

Note that the integral defining is convergent as u — » oo for < p < 1. Integrating over id and using J M (0) = 1 we 
get the leading correction term as w — > 

J»wl-6X + ... (Bll) 

where 6 M is given in Eq. (|B10|) . 

Substituting this small w behavior of J^(iu) on the rhs of Eq. (|B8[) . it follows that to match the powers of w on 
both sides, the Laplace transform gi (p) must have the following small p behavior 

32 (p) ~ -L--?= 6 pM. (B12) 

p-s-0 ^/7T V7T 

Using the classical Tauberian theorem (for a simple derivation see the appendix A. 2 of Ref. [44[), one immediately 
gets the following large z behavior of 32(2) 

2:— >CX> 2 H 

with the amplitude 

A ' = ^W^) = ^ where "» = F(T^) = SxT^y o TT<? (B14) 

Similarly, substituting the small w behavior of J^w) on the rhs of Eq. (|B9I) and matching powers of w on both 
sides, we get 

Hp) ~ --b^p^ 1 . (B15) 

p^O p 

Once again, using the Tauberian theorem of inversion, we get 

h 2 (z) ~ 1-% (B16) 



with the amplitude 



where /3 M is given in Eq. (|B14[) . 
Finally, note that the ratio 



z— >oo 



^ = r y , = ^ , (si?) 



r(i - M ) 



A„ 2 



is universal in the sense that it is independent of p G (0, 1) as well as on the scale factor a. 



(B18) 



2. The case 1 < y, < 2 



Unlike in the previous case, one finds that the first derivative of J^w) at w — is finite when /1 G [0,2] and is 
given by 
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Note that for 1 < fj, < 2, the integral in Eq. (|B19[) is convergent as z — > 0. Thus, as w —> 0, J^w) — > 1 — w. To 
obtain the leading non-analytic singular term, we need to compute the next term. By taking two derivatives with 
respect to w near w = and then re- integrating back, we find the following leading singular behavior of Jfi{w) near 
w = 

2a^ 

ttOT^IWo (i + « 2 ) 2 



J (1 (w)wl-a fl w + c fl w'' + ... where c p = — — / - ; - (B20) 



Substituting this small w behavior of J M (w) on the rhs of Eq. (|B8|) and matching powers of u> on both sides we get 



1 a M 2 

^Oy^ r(l/2+l//x) 



aa(p) ~„ — - W1 /o : i/ „, p + -= c M (B21) 



where and are given respectively in Eqs. (IB19[) and (IB20[) . Again, inverting via the Tauberian theorem (see 
Ref. Q), we get 

92(z) ~ 4t> (B22) 

with the amplitude 

2 u(u-l)c M 2/i „ „ 2a^ Z" 00 , ,„„„, 

Exactly in a similar way, we substitute the small w behavior of J^{w) on the rhs of Eq. (|B9[) . match powers of w 
on both sides and find that 

^) ~ n ~- rn?9/ ^ + c ^"' (B24) 

p^o p r(i + 2///) 

where a M and are defined in Eqs. (|B19[) and (|B20[) . Inverting via the Tauberian theorem gives the desired result 

h 2 (z) ~ 1-^ (B25) 

z— Voo 2^ 



with the amplitude 



where /3 M is given in Eq. (|B23I) . 

In this case, also we note that the ratio 



is universal and does not depend explicitly on 1 < /i < 2 and a. 

3. The case fi = 1 

In this case, from Eq. (IB4I) 



^ - 2 (B27) 



1 r 00 *7 

Ji(iy) = exp[-/i(w)] where h(w) = - - — — ? ln(l + a w u) . (B28) 



(i 



Let us first derive the leading singular behavior of I\{w) as w — > 0. Making a change of variable x — aw u in the 
integral we get 

/l(w) = v/ ^w ln(1+a:) - (B29) 
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Next, we divide the integration range [0, oo) into two parts [0, 1] and [l,oo) and write Ii(w) = Z\(w) + Z 2 (w). The 
second part Z 2 (w), i.e., the integral over [1, oo] is a completely analytic function as w — > 0. Thus the leading singular 
behavior of I\ (w) as w — > is contained only in the first part 

Zl (w) = — t _ ^ 2 ln(l + x) . (B30) 

In this integral, we can now safely expand ln(l + x) = x — x 2 / + x 3 /3 + . . . and perform the integral term by term. 
The leading singularity comes from the first term of this expansion 



Zi(w) « — / — ——dx = — In 

Jn a; + a z ur 7r 



- --tulnto (B31) 

w— >0 7T 



which indicates, from Eq. (IB28[) . that 



a 



Ji(w) ~ l + -mlnu). (B32) 



w-yO 



Substituting this small w behavior of J\ (w) on the rhs of Eq. (IB8|) and matching the leading behavior of w on both 
sides indicates that 

hip) ~ n ~i= + -i=-P^p. (B33) 

This indicates, using Tauberian inversion theorem (see Ref. |44j|). 

52(2) ~ ^ where Ai = -. (B34) 

Similarly, substituting the small w behavior of Jfj,(w) on the rhs of Eq. (|B9|) and matching leading behavior of w 
on both sides we get 

hip) - - + -lnp (B35) 
p-yo p 7r 

which, when inverted, provides the following large z behavior 

h 2 (z) - 1 where B x = - . (B36) 

2->oo Z 7T 

Finally, we notice that even for this marginal p — 1 case, the ratio A\/B\ = has the same value as in the other 

two cases, namely for < p, < 1 and 1 < p < 2. 

Let us remark that if one puts p = 1 in the expression of /3 M in Eq. (I35[) , we get /3\ = a/ir. Correspondingly 
A\ = 2a/ n 3 / 2 from Eq. (|32[) and B\ = a/n from Eq. (l33l) . we find that they are consistent respectively with A\ in 
(|B34|) and and B x in (|B36p . In other words, the final asymptotic results for g 2 (z) and h 2 (z) in the marginal case 
p, = 1 arc included in the range ^ € [1)2], even though the details for p — 1 are quite different, as it has logarithmic 
singularities. 



Appendix C: Distribution of the maximum of a lattice random walk 

In this appendix we consider N lattice random walks (RW) starting at Xi(0) — 0, for i = 1, 2, • • • , N and evolving 

as 

Xi{m) = Xi(m - 1) + rii(m) , (CI) 

where the noise rji(m)'s are i.i.d. random variables with a distribution f(rf) = \8{rj — 1) + \&{r] + 1). The aim is to 
show the result in Eq. (|53"|) . taking advantage of the relation (|52p. 

We first consider a single random walk, N = 1, and denote by W(j,n) the number of lattice RW starting at 
x±(0) = and ending in j after n steps. One has 

W(j,n) = { (t)^k = n + j,n + jeven, 
I , n + j odd . 
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To compute the cumulative distribution function (cdf) of the maximal displacement of N walkers we need to compute 
the number of walks, for a single walker N = 1, which stay strictly below a given value M. We thus denote, for N = 1, 
by Wm(Ji n) the number of walks which stays strictly below an integer M and end up in j after n steps. To do this 
we use the reflection principle, e. g. the method of images: WmO, n) can be obtained by subtracting to W(j,n) the 
number of free walks which start in x(0) = 2M and end in j after n steps. This yields: 



(fe) - L-m) , 2k = n + j , n + j 
, n + j odd . 



(C3) 



The total number of walks Wjv/(n) which start at xi(0) = and stay strictly below M after n steps are obtained by 
summing W(j,n) in Eq. (|C3j) over the endpoint j < M. This yields 



W M (n)= £ 
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k-M 



where [^J is the largest integer not greater than x. Therefore one has 
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We can now write the cdf of the maximal displacement of N independent walkers as 



Proba. 



max x max (m) < M 

0<m<n 
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k-M 



where x max (TO) = maxi<i<jv Xi(m), from which one gets 



Proba. 



max x max (m) = M 

0<m<n 



{[W M+ x{n)] N -[W M {n)] N ) 



Finally, using the identity (152^) . one obtains the result given in the text in Eq. §5 
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